On Dual Mining: From Patterns to Circumstances, and Back
نویسندگان
چکیده
Previous work on frequent itemset mining has focused on finding all itemsets that are frequent in a specified part of a database. In this paper, we motivate the dual question of finding under what circumstances a given itemset satisfies a pattern of interest (e.g., frequency) in a database. Circumstances form a lattice that generalizes the instance lattice associated with datacube. Exploiting this, we adapt known cube algorithms and propose our own, minCirc, for mining the strongest (e.g., minimal) circumstances under which an itemset satisfies a pattern. Our experiments show minCirc is competitive with the adapted algorithms. We motivate mining queries involving migration between itemset and circumstance lattices and propose the notion of Armstrong Basis as a structure that provides efficient support for such migration queries, as well as a simple algorithm for computing it.
منابع مشابه
یافتن الگوهای مکرّر در قرآن کریم بهکمک روشهای متنکاوی
Quran’s Text differs from any other texts in terms of its exceptional concepts, ideas and subjects. To recognize the valuable implicit patterns through a vast amount of data has lately captured the attention of so many researchers. Text Mining provides the grounds to extract information from texts and it can help us reach our objective in this regard. In recent years, Text Mining on Quran and e...
متن کاملThe Effect of Working Memory Training on Vocabulary Recall and Retention of Iranian EFL Learners: The Case of Dual N-Back Task
This study examined the effect of working memory training on vocabulary recall and retention ofIranian EFL learners using dual N-back task technique. To this end, 50 EFL learners at IslamicAzad University of Shoushtar were randomly assigned to the experimental (n = 25) and control (n= 25) groups. Before the treatment, a vocabulary test was administered to the participants to assessthe participa...
متن کاملHigh Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences
Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...
متن کاملInfluence of front row burden on fragmentation, Muckpile shape, Excavator cycle time, and back break in surface Limestone Mines
Front row burden is one of the key parameter to improve the bench blasting results. Improper design of the front row burden can create nuisances in the form of ground vibration, flyrock, back break or it may responsible for breakage of improper fragment size from the rockmass. Therefore, front row burden need to be optimised on the basis of proper scientific assessment. It has been proved that ...
متن کاملData sanitization in association rule mining based on impact factor
Data sanitization is a process that is used to promote the sharing of transactional databases among organizations and businesses, it alleviates concerns for individuals and organizations regarding the disclosure of sensitive patterns. It transforms the source database into a released database so that counterparts cannot discover the sensitive patterns and so data confidentiality is preserved ag...
متن کامل